Word repetitions in Japanese spontaneous speech
نویسندگان
چکیده
This paper examines several hypotheses based on a ‘strategic’ view of word repetitions in English. We test whether these hypotheses also apply to Japanese with its fundamentally different syntax. Analyses of 10 task-oriented Japanese dialogues reveal two effects. First, pauses are more frequent before and just after a word at a suspension of the speech than after a repetition of that word. Second, the first token of the repeated word is abnormally prolonged. These results support the ‘strategic’ view of repetitions. Speakers often suspend speaking after making a preliminary commitment to a constituent, but they prefer to produce that constituent with a continuous delivery. These findings suggest the generality of these strategies across languages.
منابع مشابه
Automatic Detection and Removal of Disfluencies from Spontaneous Speech
Unlike rehearsed and prepared speech, spontaneous speech contains high occurrence of disfluencies, like repetitions, filled pauses, and hesitations. Disfluencies can seriously hamper the word recognition accuracy of an Automatic Speech Recogniser (ASR), by increasing word insertion and deletion and rejection rates. In this paper we introduce signal processing algorithms to automatically identif...
متن کاملWord Level Timing in Spontaneous Japanese Speech
This study provides evidence against the hypothesis that Japanese has word level mora-timing. Unlike previous studies which used careful speech, this paper evaluates timing in a corpus of spontaneous Japanese speech from 11 speakers. Correlations between word duration and number of moras in the word are shown to be much lower than in careful speech studies. Furthermore, if there were durational...
متن کاملCoping with disfluencies in spontaneous speech recognition
Nowadays, automatic speech recognizers have become quite good in recognizing well prepared fluent speech (e.g. news readings). However, the recognition of spontaneous speech is still problematic. Some important reasons for this are that spontaneous speech is usually less articulated and contains a lot of disfluencies. In this paper, a new methodology for coping with disfluencies is presented an...
متن کاملBenefits of Disfluency Detection in Spontaneous Speech Recognition
Nowadays, automatic speech recognizers have become quite good in recognizing well prepared fluent speech (e.g. news readings). However, the recognition of spontaneous speech is still problematic. Some reasons for this are that spontaneous speech is usually less articulated and that it can contain a lot of disfluencies such as filled pauses (FPs), abbreviatons, repetitions, etc. In this paper, a...
متن کاملWord-level Dependency-structure Annotation to Corpus of Spontaneous Japanese and its Application
In Japanese, the syntactic structure of a sentence is generally represented by the relationship between phrasal units, bunsetsus in Japanese, based on a dependency grammar. In many cases, the syntactic structure of a bunsetsu is not considered in syntactic structure annotation. This paper gives the criteria and definitions of dependency relationships between words in a bunsetsu and their applic...
متن کامل